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ARTIFICIAL PROMOTER LIBRARIES FOR SELECTED ORGANISMS AND PROMOTERS DERIVED FROM SUCH 
LIBRARIES 



(57) Abstract 

An artificial promoter library for a selected organism or group of organisms is constructed as a mixture of double stranded DNA 
fragments, the sense strands of which comprise at least two consensus sequences of efficient promoters from said organism or group of 
organisms, or parts thereof comprising at least half of each, and surrounding or intermediate nucleotide sequences (spacers) of variable 
length in which at least 7 nucleotides are selected randomly among the nucleobases A, T, C and G. The sense strands of the double 
stranded DNA fragments may also include a regulatory DNA sequence imparting a specific regulatory feature, such as activation by a 
change in the growth conditions, to the promoters of the library. Further, they may have a sequence comprising one or more recognition 
sites for restriction endonuc leases added to one of or both their ends. The selected organism or group or organisms may be selected from 
prokaryotes and from eukaryotes; and in prokaryotes the consensus sequences to be retained most often will comprise the -35 signal (-35 
to -30): TTGACA and the -10 signal (-12 to -7): TATAAT or parts of both comprising at least 3 conserved nucleotides of each, while in 
eukaryotes said consensus sequences should comprise a TATA box and at least one upstream activation sequence (UAS). Such artificial 
promoter libraries can be used i.a, for optimizing the expression of specific genes in various selected organisms. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


PI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


C.E 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


Gil 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Ceniral African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


Cdte d* [voire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 98/07846 



1 



PCI7DK97/00342 



Artificial promoter libraries for selected organisms and 
promoters derived from such libraries 



FIELD OF THE INVENTION 

5 

This invention concerns artificial promoter libraries and 
a method of constructing an artificial promoter library 
for a selected organism or group of organisms. The inven- 
tion also concerns the individual novel promoters derived 

10 from such libraries. Further, the invention concerns a 
method of optimizing the expression of a gene in a se- 
lected organism by use of promoters from such an artifi- 
cial promoter library for that organism. In principle , 
artificial promoter libraries according to the invention 

15 can be constructed for use in any living organism, but 
presently they have mostly been of value for modulating 
gene expression of microorganisms. In connection with 
this invention the term "microorganism" shall be taken 
broadly to include prokaryotic organisms such as bacteria 

20 as well as eukaryotic microorganisms such as yeasts, 
other fungi and cell lines of higher organisms. 

BACKGROUND OF THE INVENTION 

25 Metabolic engineering of living organisms is still in its 
infancy with respect to industrial applications, de- 
spite the fact that genetic engineering has now been fea- 
sible for more than a decade. To a large extent, this may 
be due to the disappointing outcome of many of the at- 

30 tempts so far to improve strain performance. There are at 
least two reasons for the negative outcome of the at- 
tempts to increase metabolic fluxes: 

One is that the genetic engineer tends to overlook the 
35 subtlety of control and regulation of cellular metabo- 
lism. The expression of enzymes that are expected to be 
rate limiting are increased 10 to 100 fold, e.g. by 
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placing the gene on a high copy number plasmid. Or, a 
branching flux in a pathway is eliminated by deleting a 
gene. Quite often, this will have secondary effects on 
the metabolism, for instance by lowering metabolite con- 
5 centrations that are essential to other parts of the cel- 
lular metabolism (e.g. processes that are essential to 
the growth of the organsim) and the net result may be 
that the overall performance of the cell with respect to 
the desired product is decreased. Instead, it is neces- 
10 sary to tune the expression of the relevant gene around 
the normal expression level and determine the optimal ex- 
pression level , for instance as the level that maximizes 
or minimizes the flux. 

15 The second reason for the negative outcome lies in the 
rate limiting concept itself: both metabolic control 
theory (Kacser and Burns, 1973) and experimental determi- 
nations of control by individual steps in a pathway 
{Schaaff et a 1 . , 1989? Jensen et al. r 1993) have shown 

20 that reaction steps which were expected to be rate lim- 
iting with respect to a particular flux, turned out to 
have no or very little control over the flux. Instead, 
the control and regulation of the cellular metabolism 
turned out to be distributed over several enzymes in a 

25 pathway, and it may be necessary to enhance the expres- 
sion of several enzymes in order to obtain a higher flux. 

According to metabolic control theory, the total flux 
control exerted by all the enzymes in a pathway, should 
30 always sum up to 1, Therefore, after one enzyme concen- 
tration has been optimized, the flux control will have 
shifted to another enzyme(s), and it may then be useful 
to perform additional rounds of enzyme optimization in 
order to increase the flux further. 

35 

In summary, flux optimization requires 1) fine-tuning of 
enzyme concentration rather than many fold overexpression 
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and often 2) optimization of the level of several enzymes 
in a pathway rather than looking for the rate limiting 
step, 

5 There are now many systems available that allow one to 
increase the gene expression more than 1000 fold and/or 
to turn on gene expression at a particular time point 
during a fermentation process (e.g. using temperature in- 
ducible systems). With respect to tuning the steady state 

10 gene expression in the fermenter, to say 150% or 70% of 
the normal expression level, it becomes more difficult. 
In principle, one could use a iac-type promoter in front 
of the gene of interest, and then add a certain amount of 
an inducer of the iac system, for instance IPTG 

15 ( isopropyl-fi-D-thiogalactoside) , or use a temperature 
sensitive system at the correct temperature. These possi- 
bilities are often not practical for large scale indus- 
trial applications. The alternative is to use a promoter 
that has exactly the right strength. However, such pro- 

20 moters are seldom available, and furthermore one needs a 
range of promoter activities in order to optimize the ex- 
pression of the gene in the first place, see below. 

During the past two decades, much work has been done to 
25 define and optimize the consensus sequences of microor- 
ganisms. In many prokaryotes, one often finds two more or 
less conserved DNA sequences at approximately position - 
10 and -35 relative to the start site for transcription, 
TATAAT and TTGACA, respectively, with aproximately 17 
30 basepair between the two. The dogma in this field is 
that, by including these elements, the resulting promot- 
ers would tend to become strong. Indeed, promoter up mu- 
tations, which are relatively rare events, ususally re- 
sults in a better match to the above consensus sequences, 
35 while down mutations results in a poorer match to the 
consensus sequences or a less optimal distances between 
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these. In addition , when random DNA sequences are cloned 
in place of one of the two consensus sequences, the 
strength of the resulting promoters usually correlate 
with the degree of homology to the consensus sequences. 

5 

In principle, modulation of the strength of promoters 
could then be achieved by basepair changes in the consen- 
sus sequences or by changes in the length of the spacer 
between these. But the impact of such changes on the pro- 
10 moter strength will tend to be large (see example 1 of 
this invention), and it is therefore not feasible to 
achieve small steps of strength modulation through base- 
pair changes in the consensus sequences . 

15 While the length of the spacer separating the two consen- 
sus sequences is known to play an important role for the 
strength of a promoter, the sequence of the spacers be- 
tween the consensus sequences has usually been considered 
to be of little importance for the strength of the pro- 

20 moters, and attempts to identify additional consensus se- 
quences in the spacer region through mutagenesis have in- 
deed been unsuccesful. So far f nobody has attempted to 
randomize the spacer, while keeping the consensus se- 
quences and the spacer length relatively constant. 

25 

Numerous experiments have been carried out in order to 
define and optimize the consensus sequences of micro- 
organisms, including experiments where at least one of 
the consensus sequences was being randomized. In some of 

30 these experiments a part of the nucleotides surrounding 
the consensus sequences was also randomized in order to 
allow for the generation of promoters with length of 
spacers different from 17 bp, and/or in order to find 
possible new consensus motifs around the consensus se- 

35 quences. The chances that this will generate an efficient 
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promoter is very small and a selection must be applied in 
order to find those rare cases where the homology to the 
consensus sequence is sufficiently high to result in even 
a weak promoter. 

5 

OBJECT OF THE INVENTION 

The promoter libraries that we are aiming at should cover 
the entire range of promoter activities that could become 

10 of interest for engineering of a particular species, 
preferably from the weakest promoters detectable, to the 
strongest promoters possible. Moreover, we aim at cover- 
ing this broad activity range in small steps, say in- 
crease in activity by 50-100% per step in order to be 

15 suitable for the purpose of flux optimization as de- 
scribed above. 

In this invention, we show that the sequence of the spac- 
ers between the consensus sequences are far more impor- 

20 tant than it has previously been appreciated. The spacer 
sequence can have a strong impact on promoter strength, 
when 1) a major part of the spacer sequence is varied si- 
multaneously and in a random manner and 2), at the same 
time, at least half of the consensus sequences are kept 

25 constant. We show that if these two conditions are ful- 
filled, our method can be used to generate promoters that 
cover wide ranges of activities, including very strong 
promoters. The range of promoter activities is covered in 
small steps of activity change which makes these promot- 

30 ers very suitable for metabolic engineering purposes. In 
addition, we show that the method can be used to generate 
promoters for a wide range of organisms and at an unusual 
high frequency. 
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SUMMARY OF THE INVENTION 

The present invention provides an artificial promoter li- 
brary for a selected organism or group of organisms, com- 
5 prising a mixture of double stranded DNA fragments the 
sense strands of which comprise at least two consensus 
sequences of efficient promoters from said organism or 
group of organisms, or parts thereof comprising at least 
half of each, and surrounding or intermediate nucleotide 
10 sequences (spacers) of variable length in which at least 
7 nucleotides are selected randomly among the nucleobases 
A, T, C and G, with the proviso that previously known 
promoter sequences and promoter sequences isolated from 
natural sources are not comprised. 

15 

The broadest variation in promoter strengths is obtained 
when at least 10, preferably at least 12, and more pref- 
erably at least 14, nucleotides in the spacer sequence(s) 
are selected randomly among the nucleobases A, T, C and 
20 G. 

The sense strands of the double stranded DNA fragments 
may also include a regulatory DNA sequence imparting a 
specific regulatory feature to the promoters of the li- 
25 brary. Such specific regulatory feature is preferably ac- 
tivation by a change in the growth conditions , such as a 
change in the pH, osmolarity , temperature or growth 
phase . 

30 For cloning purposes the double stranded DNA fragments 
usually have sequences comprising one or more recognition 
sites for restriction endonucleases added to one of or 
both their ends; most conveniently sequences specifying 
multiple recognition sites for restriction endonucleases 

35 (multiple cloning sites MCS) . 



WO 98/07846 



7 



PCT/DK97/00342 



The selected organism or group of organisms may be se- 
lected from prokaryotes and from eukaryotes, in particu- 
lar from prokaryotes and eukaryotic microorganisms such 
as yeasts, other fungi and cell lines of higher organ- 
5 isms . 

An interesting group of prokaryotes i.a. in the dairy in- 
dustry consists of lactic acid bacteria of the genera 
Lactococcus , Streptococcus , Enterococcus , Lactobacillus 

10 and Leuconostoc f in particular strains of the species 
Lactococcus lactis and Streptococcus thermophilus . Other 
interesting prokaryotes are bacteria belonging to the 
genera Escherichia, Bacillus and Pseudomonas, in particu- 
lar the species Escherichia coli f Bacillus subtilis and 

15 Pseudomonas putida. 

In an artificial promoter library for prokaryotes said 
consensus sequences may for example comprise the -10 sig- 
nal (-12 to -7): TATAAT and at least one activator pro- 
20 tein binding site upstream of the -10 signal or parts 
thereof comprising at least 3 conserved nucleotides of 
each. 

Most often the consensus sequences to be retained in an 
25 artificial promoter library for prokaryotes will comprise 
the -35 signal (-35 to -30): TTGACA and the -10 signal (- 
12 to -7): TATAAT or parts of both comprising at least 3 
conserved nucleotides of each, 

30 More efficient promoters are usually obtained when said 
consensus sequences comprise from 4 to 6 conserved nu- 
cleotides of the -35 signal and from 4 to 6 conserved nu- 
cleotides of the -10 signal, preferably 5 or 6, and more 
preferably all 6 nucleotides of each. The most efficient 

35 promoters are obtained when said consensus sequences fur- 
ther comprise intervening conserved motifs, e.g. selected 
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from the conserved motifs -44 to -41: AGTT, -40 to -36: 
TATTC, -15 to -14: TG, and +1 to +8: GTACTGTT . 

In such promoters the length of the spacer between the 
5 35 signal and the -10 signal should be 14-23 bp, prefera- 
bly 16-18 bp, and more preferably 17 bp. This should be 
understood to mean the spacer length between the hexamer 
signals, even when some of the nucleotides in the signals 
have been mutated. 

10 

In eukaryotic organisms said consensus sequences should 
comprise a TATA box and at least one upstream activation 
sequence (UAS) . 

15 An interesting eukaryotic microorganism is the yeast spe- 
cies Saccharomyces cerevisiae, normal baker's yeast. In 
promoters to be used in Saccharomyces the consensus se- 
quences may further comprise a transcription initiation 
signal (TI box) functioning in Saccharomyces cerevisiae. 

20 

In a specific embodiment of an artificial promoter li- 
brary according to the invention for Saccharomyces cere- 
visiae said consensus sequences comprise the TATA box: 
TAT AAA, the UAScCN4p- TGACTCA, and the TI box: 
25 CTCTTAAGTGCAAGTGACTGCGA, which also functions as the 
binding site for the arginine repressor, argR. 

The individual promoters of the artificial promoter li- 
braries defined above are also comprised by the inven- 

30 tion. Specific promoters which have been constructed ac- 
cording to the following examples are those stated in the 
SEQ IDs Nos. 5 to 58 below. The invention further com- 
prises artificial promoters which are derived from pro- 
moters defined by the artificial promoter libraries of 

35 the invention. 
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The present invention also provides a method of con- 
structing an artificial promoter library for a selected 
organism or group of organisms, which comprises selecting 
at least two consensus sequences of efficient promoters 
5 from said organism or group of organisms; synthesizing a 
mixture of single stranded DNA sequences comprising said 
consensus sequences, or parts thereof comprising at least 
half of each, and surrounding or intermediate nucleotide 
sequences (spacers) of variable length in which at least 
10 7 nucleotides are selected randomly among the nucleobases 
A, T, C and G; and converting the single stranded DNA se- 
quences into double stranded DNA fragments by second 
strand synthesis. 

15 As previously mentioned, the broadest variation is ob- 
tained when at least 10, preferably at least 12 , and more 
preferably at least 14, nucleotides in the spacer se- 
quence^) are selected randomly among the nucleobases A, 
T, C and G. 

20 

In order to obtain an artificial promoter library which 
is susceptible to regulation, the single stranded DNA se- 
quences which are synthesized may include a regulatory 
DNA sequence imparting a specific regulatory feature to 
25 the promoters of the library. Such specific regulatory 
feature is preferably activation by a change in the 
growth conditions, such as a change in the pH, osmolar- 
ity, temperature or growth phase. 

30 Also, in order to obtain an artificial promoter library 
suitable for cloning , a sequence specifying one or more 
recognition sites for restriction endonucleases may be 
added to one of or both the ends of the single stranded 
DNA sequences in the synthesis, or a linker comprising 

35 such restriction sites may be ligated to one of or both 
the ends of the double stranded DNA fragments. Most con- 
veniently such sequences specify multiple recognition 
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sites for restriction endonucleases (multiple cloning 
sites MCS) . 

The selected organisms for which artificial promoter li- 
5 braries can be prepared by the method according to the 
invention and the various degenerated sequences to be 
chosen for the promoter libraries of specific organisms 
are the same as discussed above for the artificial pro- 
moter libraries per se, 

10 

With respect to possible uses of the artificial promoter 
libraries described above , the invention further provides 
a method of optimizing the expression of a gene in a mi- 
croorganism, which comprises 

15 

a) selecting a set of promoters covering a range of pro- 
moter activities in relatively small steps of activity 
change from an artificial promoter library according to 
any one of claims 1-26 or constructed by the method ac- 

20 cording to any one of claims 30-55; 

b) cloning said set of promoters into said organism plac- 
ing in each clone said gene under the control of at least 
one promoter from the set; 

25 

c) growing the selected clones and screening them to find 
the one showing optimized flux of product formation. 

This method is preferably used with organisms selected 
30 from the group consisting of prokaryotic and eukaryotic 
microorganisms such as bacteria, yeasts, other fungi and 
mammalian cell lines. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 .. A library of artificial promoters for L. lac- 
tis, from example 1. The promoter activities (Miller 
5 units) were assayed from the expression of a reporter 
gene (lacLM) encoding b-galactosidase transcribed from 
the different synthetic promoter clones, on the promoter 
cloning vector pAK80, in strain MG1363, grown in GM17 me- 
dium supplemented with 2 ng/ml erythromycin. The patterns 
10 of the data points indicate which promoter clones dif- 
fered in either the -35 or the -10 consensus sequence, or 
in the length of the spacer between these two consensus 
sequences . 

15 Figure 2. A library of artificial heatshock regulated 
promoters for L. lactis, from example 2. The promoter ac- 
tivities (Miller units) were assayed from the expression 
of a reporter gene (gusA) encoding P-glucuronidase , tran- 
scribed from the different synthetic promoter clones, on 

20 the chromosome in strain LB436. The cells were grown in 
GM17 medium supplemented with 2 |ig/ml erythromycin, and 
at two different temperatures, 30 and 37 °C. The asssay 
for p-glucuronidase was parallel to the p-galactosidase 
assay (see example 1) except that X-gluc was used as the 

25 substrate. 

Figure 3. The library of artificial promoters from exam- 
ple l r assayed for activity in E. coli. The promoter ac- 
tivities (Miller units) were assayed from the expression 

30 of a reporter gene (lacLM) encoding P-galactosidase tran- 
scribed from the different synthetic promoter clones, on 
the promoter cloning vector pAK80, in strain BOE2 70, 
grown in LB medium supplemented with 200 ng/ml erythromy- 
cin. The patterns of the data points indicate which pro- 

35 moter clones differed in either the -35 or the -10 con- 
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sensus sequence , or in the length of the spacer between 
these two consensus sequences . 

Figure 4- A library of artificial promoters for 5. cere- 
5 visiae, from example 7. The promoter activities (Miller 
units) were assayed from the expression of a reporter 
gene (lacZ) encoding P-galactosidase transcribed from the 
different synthetic promoter clones, on the promoter 
cloning vector pYLZ-2 in S. cerevisiae strain SG24 (URA3- 
10 52), grown in SD minimal medium supplemented with 2% glu- 
cose* YP24 and YP435 has a 1 bp deletion and a point mu- 
tation, respectively, in the UASccN4p binding site. 
pTKlOl contains a promoter in which the UAS sequence has 
been deleted and the TATA box is present. 

15 

Figure 5. Regulation of artificial yeast promoters by ex- 
ternal arginine, from example 7. The promoter activities 
(Miller units) were assayed from the expression of a re- 
porter gene (lacZ) encoding P-galactosidase transcribed 

20 from the different synthetic promoter clones, on the pro- 
moter cloning vector pYLZ-2 in S. cerevisiae strain SG24 
(URA3-52), grown in SD minimal medium supplemented with 
2% glucose, with (SDG) or without 1 g/1 arginine 
(SDG+arg). YP183 has a 13 bp deletion in the binding site 

25 for the argR repressor. 

Figure 6, Regulation of artificial yeast promoters by ex- 
ternal amino acids, from example 7. The promoter activi- 
ties (Miller units) were assayed from the expression of a 
30 reporter gene (lacZ) encoding P-galactosidase transcribed 
from the different synthetic promoter clones, on the pro- 
moter cloning vector pYLZ-2 in S. cerevisiae strain SG24 
(URA3-52), grown in SD minimal medium supplemented with 
2% glucose (SDG) or complex medium (containing amino ac- 
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ids) without uracil (SC-ura) , YP24 has a 1 bp deletion in 
the UASQCN4p binding site. 

DETAILED DESCRIPTION OF THE INVENTION 

5 

In our method, degenerated oligonucleotides are synthe- 
sized for the organism or group of organisms for which a 
promoter library should be constructed. The sequences of 
the oligonucleotides are written by combining the avail- 

10 able knowledge from the literature, on the features that 
makes a promoter function efficiently in that particular 
organism. The amount of information that needs to be 
fixed in the oligonucleotide is somewhat variable among 
different organisms. In E. coli for instance, promoters 

15 of considerable strength may be formed by less perfect 
matches in the -35 and -10 consensus sequences and by 
spacing between these sequences deviating from 17bp 
( TTGACA and TATAAT respectively), whereas the require- 
ments for strong promoters in L. lactis appear to be more 

20 strict. 

Secondly, the single stranded oligonucleotides are con- 
verted into double stranded DNA fragments and cloned into 
a promoter probing vector. The promoter-containing clones 

25 are identified e.g. by their ability to give colonies 
with various extents of colour on indicator plates. This 
should in principle give us only very strong promoters, 
but we discovered that by allowing the spacer sequences 
between the consensus sequences to vary in a random man- 

30 ner, the strength of the resulting promoters are modu- 
lated. In fact, using this method we obtained promoter 
libraries, spanning the entire range of promoter activi- 
ties that is likely to become of interest, in small steps 
of activity change. 
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Optimization of gene expression could then be achieved as 
follows: 1) From the promoter library one chooses promot- 
ers that have e.g. 25%, 50%, 100%, 200% and 400% of the 
strength of the wild-type promoter. 2) Then, these pro- 
5 moters are cloned in place of the wild-type promoter up- 
stream of the gene of interest. 3) The magnitude of the 
variable to be optimized (e.g. the flux through a path- 
way) that is obtained with each of these five constructs 
is then measured and the optimal construct is used di- 

10 rectly as the production organism. It may be necessary to 
fine-tune the expression further or to expand the range 
of promoter activities. A direct advantage of this system 
over the inducible systems described above is that once 
the optimal promoter activity has been determined, the 

15 strain is in principle ready for use in the fermentation 
process . 

In one preferred embodiment the random spacers method of 
the invention is used for generating a series of consti- 

20 tutive promoters for the Gram-positive bacterium, Lacto- 
coccus lactis. In other preferred embodiments we show 
that promoters generated by the random spacers method of 
the invention are functional in at least two species of 
Gram-negative bacteria, Escherichia coli and Pseudomonas, 

25 as well as in the Gram-positive bacterium Bacillus sub- 
tilis. We also show that the strength of the individual 
promoter is dependent on which organism it is being used 
in, i.e. that in some organisms a particular promoter is 
strong, in others it is weak, but in all the organisms 

30 tested, the promoters cover a broad range of activities, 
in small steps of activity change. 

Often it is desirable to activate gene expression to a 
certain extent and at a certain stage of a fermentation, 
35 e.g. because the gene product that is expressed inhibits 
the growth of the cells. It is then useful to combine the 
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above technique for obtaining promoters of different 
strength with some regulatory mechanism, e.g. so that the 
promoter will be activated by a change in the pH f tem- 
perature or growth phase. 

5 

Thus, in another preferred embodiment the random spacers 
method of the invention is used for generating a series 
of specifically regulated promoters. As illustrated in 
Example 2, the above approach is used in combination with 
10 specific regulatory DNA sequences to generate a library 
of heatshock-regulated promoters for the Gram-positive 
bacterium, Lactococcus lactis. 

In addition to prokaryotes, eukaryotic microorganisms 
15 (yeast and other fungi as well as mammalian cell lines,) 
are important microorganisms for production of a range of 
organic compounds and various proteins. It is therefore 
of interest to develop the above approach for modulating 
gene expression in these organisms as well. Thus, in yet 
20 another preferred embodiment, as illustrated in Example 
7, the random spacers method of the invention is used for 
generating a series of promoters for the bakers yeast, 
Saccharomyces cerevisiae. The promoters are here equipped 
with GCN4p and ARGR regulation. 

25 

The regulation of transcription initiation in the eukary- 
otic cell is somewhat more complex compared to the pro- 
karyote. The transcription start site is normally pre- 
ceded by a so-called TATA box that contains the consensus 

30 sequence TATAAA or parts hereof, but unlike in the pro- 
karyote, the distance from the TATA box to the transcrip- 
tion start site is much less defined. In Saccharomyces 
cerevisiae this distance is typically 40-120 nucleotides 
(Oliver and Warmington, 1989). The so-called -35 consen- 

35 sus hexamer which is found in many prokaryotic promoters 
is absent in Saccharomyces cerevisiae. Instead so-called 
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upstream activation sequences (UAS) are found upstream of 
the transcription initiation site. These UAS are recog- 
nised by specific DNA binding proteins that can then act 
as activators of transcription initiation. For instance, 
5 the UAS sequence that is found upstream of the genes in- 
volved in aminoacid biosynthesis, UASQCN4pf consists of a 
DNA sequence that specifies a binding site for the GCN4 
protein, which activates the transcription of these genes 
(Hinnebusch, 1992). In contrast to prokaryotes, where the 
10 distance between the promoter elements appears to be 
critical for the strength of promoters (see example 1), 
the distance between the TATA box and the UAS sequence in 
eukaryotic promoters is highly variable and may be up to 
about 1000 bp* Some genes even contain more than one copy 
15 of the UAS, but one seems to be sufficient for activa- 
tion. 

One of the UAS sequences known from yeast is the binding 
site for the GCN4 protein. Promoters that contain the 
binding site for GCN4 protein should be regulated by the 
status of amino acid supply in the growth medium: in the 
absence of amino acids, the GCN4 protein is formed and 
binds specific UAS sequences to stimulate transcription 
at promoters involved in biosynthesis of aminoacids. The 
consensus sequence for the GCN4 protein binding site 
(UASGCN4p) is a short inverted repeat, TGACTCA. 

A promoter in Saccharomyces cerevisiae that is activated 
by the GCN4 protein is the ARG8 promoter. In this pro- 
30 moter, there is only one copy of the UASccN4p sequence, 
and it is located 59 bp from the TATA box (we refer to 
this distance as spacer 1). Transcription initiation 
takes place some 40-60 nucleotides downstream of the TATA 
box. The ARG8 promoter also contains a DNA sequence that 
35 functions as the binding site for the arginine repressor, 
argR (Crabeel et ai., 1995), which makes the promoter 
four fold repressible by external arginine. 



20 



25 



WO 98/07846 



PCI7DK97/00342 



17 

Thus, in this case the promoter is located within 136 bp, 
and it contains two regulatory features, which makes the 
system attractive for developing a promoter library by 
the random spacers method of the invention as outlined in 
5 the previous examples. But the method is not limited to 
this model system; in principle, any combinations of TATA 
boxes, UAS sequences, repressor binding sites etc., sepa- 
rated by spacers smaller than about 1000 nucleotides, 
should be suitable as a starting point for this method. 

10 

EXAMPLE 1 

Design of a degenerated oligonucleotide for a L. lactis 
promoter library. 

15 

According to the literature (see review in de Vos & Si- 
mons, 1994), strong promoters in L. lactis tend to have 
the following nucleotide sequences in common (numbers re- 
fer to the position relative to the transcription initia- 

20 tion site, which is given number +1): -12 to -7: TATAAT; 
-15 to -14: TG; -35 to -30: TTGAC A . The spacing between - 
10 and -35 seems to be 17 nucleotides. However, closer 
comparison of the promoter sequences that have been pub- 
lished for L. lactis reveals that in a number of posi- 

25 tions besides the ones mentioned above, nucleotides are 
more or less well conserved. Some of these positions are: 
-1: A; -3: A or T (=W); -6: A; -13: A or G (=R); -40 to - 
36: TATTC. In addition, Nilsson and Johansen (1994, BBA) 
pointed out two motives, +1 to +8: GTACTGTT, and -44 to - 

30 41: AGTT, that appear to be well conserved between rela- 
tively strong promoters (promoters for transfer RNA and 
ribosomal RNA operons) from L. lactis. These motives may 
confer both strength and growth rate dependent expression 
from the promoter. 

35 

When these additional motives are included, one arrives 
at the following 53 nucleobase degenerated sequence for 
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an efficient promoter in L. lactis. Out of these 53 nu- 
cleobases, 34 bases are conserved, two are semi-conserved 
(R and W) and 17 are allowed to vary randomly between the 
four nucleobases. 



5 ' AGTTTATTC TTGACAN NNNNNNNNNNNNNTGRTATAATANNWN AGTACTGTT 3' 



In addition , this degenerated sequence is flanked by se- 
quences that specify multiple recognition sites for re- 
JO striction endonucleases (multiple cloning site MCS), and 
the actual oligonucleotide mixture to be synthesized has 
the following degenerated sequence reported in SEQ ID No. 



Mbol 
Dpnl 
Alwl 
NlalV 
20 BstYI 

BamHI Msel 

Alwl Aflll Sspl Nsil 
1 ..... 
5 ' CGGGATCCTTAAGAATATTATGCATNNNNN AGTT TATTC TTGACA NNNNNNNKNNNNNNT 

25 

Alul 
PvuII 
NspBII 
Sfcl 

30 Msel Fnu4HI 

Rsal Hpal PstI 
Seal Hindi Bbvl EcoRI 
61 100 
G R TATAATA NNWN AGT ACT GTTA ACTG CAGCT GAATT C GG 3 ' 



A mixture of oligonucleotides according to this specifi- 
cation was synthesized by Hobolth DNA synthesis. 



This oligonucleotide mixture is single stranded initially 
40 and must be converted into double stranded DNA fragments 
for the purpose of cloning. This was done by synthesizing 
in vitro a 10 bp oligonucleotide, having a sequence com- 
plementary to the 3' end of the promoter oligonucleotide. 
This oligonucleotide was then used as a primer for second 
45 strand synthesis by the Klenow fragment of DNA polymerase 
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I in the presence of dATP, dCTP, dGTP and dTTP. In this 
way the second DNA strand became exactly complementary to 
the first DNA strand. 

5 The result is a mixture of 100 bp double stranded DNA 
fragments containing multiple recognition sites for re- 
striction endonucleases in both the 3' and 5' end. These 
DNA fragments were then cut with restriction endonucle- 
ases in order to create suitable "sticky" ends compatible 

10 with the ends of the vector DNA fragment, pAK80 
(Israelsen et al. t 1995). pAK80 is a shuttle vector, 
meaning that it has replication origins for propagation 
in both E. coli and L. lactis. In this way, the cloning 
steps can be conveniently performed in E. coli, while the 

15 subsequent physiological experiments can be done in L. 
lactis. Furthermore, pAK80 carries a promoterless (3- 
galactosidase reporter gene system (lacLM) downstream a 
multiple cloning site. Thus, pAK80 does not express the 
lacLM genes, unless a promoter is inserted in the multi- 

20 pie cloning site. 

Two cloning strategies were used for cloning the mixture 
of double stranded DNA fragments into the cloning vector 
pAK80: 

25 

1) The mixture was digested with BamHI and PstI and the 
vector pAK80 with PstI and Bglll (Bglll is compatible 
with BamHI) , 

30 2) The mixture was digested with Sspl and Hindi and the 
vector pAK80 with Smal (all three enzymes produce blunt 
end DNA fragments). 

In both cases the vector DNA was further treated with 
35 Calf Intestine Phosphatase (CIP) to prevent religation of 
the cloning vector. Subsequently, the fragment mixture 



WO 98/07846 



PCT/DK97/00342 



,20 

and vector DNA were ligated overnight at 16 °C using T4 
DNA ligase and standard ligation conditions. 

Ligation mixtures were transformed into E. coli K-12 
5 Mac, with selection for erythromycin resistance. Cells 
of the E. coli K-12 strain, MT102, were made competent 
using standard treatment with Ca ++ ions (Maniatis et al., 
1982). Ligation mixtures were then transformed into these 
cells using a standard transformation procedure (Maniatis 

10 et al., 1982), and the resulting clones were screened for 
p-galactosidase activity that will produce blue colonies 
on plates containing X-gal ( 5-bromo-4-chloro-3-indolyl-J3- 
D-galactoside) . The transformation mixture was plated on 
LB plates containing 200 ng/ml erythromycin, 1% glycerol 

15 and 100 ng/ml X-gal. 150 erythromycin resistant transfor- 
mants were obtained, all white initially, but after pro- 
longed incubation (two weeks at 4 °C), 46 of these colo- 
nies had become light blue. Thus, using strategy 1) we 
found 17 blue colonies (CP30 to CP46), and using strategy 

20 2) we found 29 blue colonies (CP1 to CP29). 

Plasmid DNA was isolated from each of these clones (CP1 
to CP46) and analysed by restriction enzyme mapping. 
Nearly all plasmids (except CP31 and CP43) contained pro- 
25 moter fragments inserted into the MCS of pAK80, in the 
orientation that would direct transcription of the other- 
wise promoterless lacLM genes on this vector. 

These 46 plasmid DNA preparations were then transformed 
30 into L. lactis subspecies lactis MG1363 with selection 
for erythromycin resistance. Cells of the L. lactis sub- 
species lactis strain, MG1363 (Gasson, 1983) were made 
competent by growing the cells overnight in SGM17 medium r 
containing 2% glycine, as described by Holo and Nes 
35 (1989). Plasmid DNA from each of the 46 clones described 
above was then transformed into these cells using the 
electroporation procedure (Holo and Nes, 1989). After re- 
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generation, the cells were plated on SR plates containing 
2 |ag/ml erythromycin. Subsequent screening for blue color 
on X-gal plates revealed large differences in p- 
galactosidase activity between the 46 clones; some clones 
5 gave dark blue colonies after 24 hours of incubation, 
others only light blue colonies after more than 1 week of 
incubation. 

The P-galactosidase activities of liquid cultures of the 
10 46 clones in MG1363 were then determined as described by 
Miller (1972) and modified by Israelsen et al. (1995). 
Cultures of the strain MG1363, each carrying one of the 
46 plasmid derivatives of pAK80 f were grown in M17 medium 
supplemented with 1% glucose overnight at 30 °C . 25-100 
15 nl of these cultures were then used in the subsequent P- 
galactosidase assay, except in the case of the weakest 
promoter clones, where 2 ml of culture was used (after 20 
fold concentration by centrif ugation) . These results are 
shown in Figure 1. Apparently, there are very large dif- 
20 ferences in the efficiency of the cloned promoter frag- 
ments, and together these clones cover a range of pro- 
moter activities from 0.3 units of P-galactosidase activ- 
ity to more than 2000 units, which is probably the 
strongest promoters known for this organism 

25 

In addition, the broad range is covered by small changes 
in activity and, therefore, these promoter fragments will 
allow us not only to obtain a wide range of expression of 
genes in L. lactis, but also to tune the expression of 
30 genes in L. lactis in small steps for the purpose of flux 
optimization, 

DNA sequencing of the 46 clones described above revealed 
that most of the inserted promoter fragments had the DNA 
35 sequence that was originally specified for the oligonu- 
cleotide design (see above), whereas the sequence of the 
remaining fragments deviated slightly from that sequence. 
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Most of the promoter fragments that gave the lower ac- 
tivities in the p-galactosidase assay (70 units of 0- 
galactosidase or less) had either an error in one of the 
consensus sequences or a 16 bp spacer between the consen- 
ts sus sequences. This result is in accordance with the 
dogma , i.e. that changes in the consensus sequences 
have strong effects on the activity of a given promoter, 
and emphasizes the fact that a more subtle approach is 
needed in order to generate a promoter library that cov- 

10 ers a range of activities in small steps of activity 
change. Clearly , if we would have allowed only changes in 
the consensus sequences and/or changes in the length of 
the spacer, instead of allowing the sequence of the spac- 
ers to vary randomly, only fairly weak promoter clones 

15 would have resulted, and the resulting library would not 
be suitable for metabolic engineering. 

In general, the clones that gave high activities (more 
than 70 units) had the same sequence as specified by the 

20 oligonucleotide. In total, the activity of the clones 
that had intact consensus sequences and 17 basepair 
spacer length, spanned activities from 5 units (CP4) to 
2000 units (CP25). These results therefore show that at- 
least a 400 fold change in promoter activity can be ob- 

25 tained by randomizing the spacer while the consensus se- 
quences are kept constant. 

Usually, for metabolic engineering purposes, relatively 
strong promoters are desired. However, there may also be 

30 cases where rather weak promoters are needed. The rela- 
tively few errors that had occurred during synthesis of 
the above oligonucleotide mixture, were not intended to 
be present in the promoter fragments; and our data then 
suggest that it may be useful to generate, deliberately, 

35 a mixture of oligonucleotides which have a low percentage 
of errors in the consensus sequences . 
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Enzymes used in the various enzymatic reactions above 
were obtained from and used as recommended by Pharmacia 
and Boehringer. 

5 EXAMPLE 2 

Design of a degenerated oligonucleotide for a library of 
temperature regulated L. lactis promoters. 

10 This example illustrates the development of a temperature 
regulated promoter library for L. lactis. A regulatory 
element comprising an eight basepair inverted repeat that 
has been shown to be involved in the heatshock response 
of L. lactis is inserted a few base pairs upstream of the 

15 -35 sequence. The minimal extent of such a regulatory 
element seems to be 27 basepairs: 

5 ' -TTAGCACTCNNNNNNNNNGAGTGCTAA- 3 ' 
IR spacer IR 

20 

containing a 9 bp (or longer) inverted repeat (IR) sepa- 
rated by 9 (or fewer) basepairs. It should therefore be 
possible to combine this inverted repeat with the ap- 
proach for obtaining constitutive promoters of different 
25 strength and thus obtain a series of promoters with vari- 
ous basal activities which can be induced several fold by 
changing the temperature of the culture medium. 

Therefore, an oligonucleotide was designed, which in- 
30 eludes the core part (from position -35 to +6) of the se- 
quence from the constitutive promoter series above (see 
example 1 and SEQ ID No. 1). The sequence upstream of the 
-35 hexamer has been replaced by the above inverted re- 
peat sequence, and the sequence downstream of position +6 
35 has also been modified, eliminating two conserved regions 
compared to example 1 (+1 to +8: GTACTGTT and -44 to -41: 
AGTT, which have been implicated in growth rate regula- 
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tion, but which turned out to be dispensable with respect 
to creating strong promoters for L. lactis, see example 
1). The sequence of the spacer (sp.l) between the two in- 
verted DNA sequences in the inverted repeat was here al- 
5 lowed to vary randomly in order to see whether this had 
any effect on the temperature regulation of the resulting 
promoters, e.g. how many fold they could be induced by 
changing the temperature. The importance of the spacing 
(sp.2) between the inverted repeat and the -35 hexamer is 

10 not known, but in principle this region may contribute to 
or modulate the heatshock response of promoters. In order 
to limit the number of parameters, however, we have cho- 
sen here to include a naturally occuring configuration 
(derived from the dnaJ promoter from L. lactis; van As- 

15 seldonk et ai., 1993): a short spacer sequence consisting 
of 5 times T. 

When these sequences are combined, one arrives at the 
following 7 3 bp consensus sequence for a temperature 
20 regulated promoter in L. lactis. Out of these 73 bp, 45 
are conserved, two are semi-conserved (R and W) and 26 
are allowed to vary randomly between the four nucleo- 
bases . 

25 5 ' TTAGCACTCNNNNNNNNNGAGTGCTAATTTT TTTGACAN NNNNNNNNNNNNNTgR 
IR spacer 1 IR sp.2 spacer 3 

X&TAAIANNWNAGTACTG 3* 

30 In addition, this degenerated sequence was flanked by se- 
quences that specify multiple recognition sites for re- 
striction endonucleases (multiple cloning sites MCS), and 
the actual oligonucleotide mixture that is being synthe- 
sized has the following degenerated sequence reported in 

35 SEQ ID No. 2: 
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Mbol 
Dpnl 
Alwl 
NlalV 

5 BstYI Msel Msel 

BamHI Alul Sspl 
Alwl Hindlll Asel 
1 ..... 
5 1 CGGGATCCAAGCTTAATATTAATTAGCACTCNNNNNNNNNGAGTGCTAATTTTTTTGACA 
10 IR IR -35 

Alul 
PVUII 
NspBII 

15 Sfcl EcoRI 

PstI Apol 
Rsal Fnu4HI Mael 
Seal Bbvl Xbal 
61 A . T . . .113 

20 NNNNNNNNNNNNNNTGG TATAATA NNANAGTACTGCAGCTGTCTAGAATTCGG 3 ' 

-15 -10 +1 



This oligonucleotide mixture was converted into double 
stranded DNA fragments (DSDF) and cloned into the pro- 

25 moter cloning vector pLB85i. pLB85i has an origin for 
replication in E. coll but not in L. lactis and se- 
lectable markers for both organisms. Instead of an origin 
of replication for L. lactis, it contains the attP se- 
quence which can direct the insertion of the plasmid into 

30 the L. lactis chromosome if the int gene product is sup- 
plied in trans. This example therefore also serves to il- 
lustrate the modulation of chromosomally encoded genes , 
using promoters generated through the random spacers 
method. pLB85i is also a promoter cloning vector and con- 

35 tains a multible cloning site upstream of a promoterless 
gusA gene. gusA is similar to the lacZ and lacLM screen- 
ing sytems used in the other examples, except that the 
substrate is X-gluc instead of X-gal. It was chosen as 
the reporter gene for this particular application, where 

40 heatshock regulated promoters are supposed to be ana- 
lysed. This was because the gusA gene product is not heat 
labile, which seems to be a problem connected with the 
lacLM gene used in example 1. 
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The DSDF mixture was here digested with Xbal and Asel and 
the vector pLB85i was digested with Xbal and Ndel (Ndel 
is compatible with Asel) and further treated with alka- 
line phosphatase to remove the 5 phosphate groups from 
5 the vector. After ligation of the DFDS mixture and the 
vector, the ligation mixture was transformed into strain 
KW1, a gusA negative E. coli strain, with selection for 
ampicillin resistance. This resulted in approximately 100 
colonies of which 80% had different blue color intensi- 

10 ties, indicating that these clones were carrying putative 
heatshock promoter fragments. 20 blue clones were picked 
for further analysis. Restriction analysis of plasmid 
preparations from these clones showed that they had in- 
serts of aproximately the right size. Subsequently, 6 of 

15 these plasmid preparations were used to transform LB436 
(a derivative of L. lactis MG1363 which contain a second 
plasmid, pLB81, supplying the necessary int gene for in- 
tegration of the plasmids into the attB site on the L. 
lactis chromosome). From each of the transformations 

20 colonies were isolated with putative integrations of the 
constructs into the attB attachment site on the L. lactis 
chromosome. The integration of the constructs was con- 
firmed by standard colony PCR, using one primer in the 
attB region and one primer in the pLB85i region. The 

25 clones were subsequently tested for their abillity to 
form blue colonies on GM17 medium supplemented with 
erythromycin and 100 |ig/ml X-gluc at 30 °C and 37 °C, re- 
spectively, and the clones showed a clear difference in 
color intensity at the two temperatures, indicating that 

30 the promoter activities were now under temperature regu- 
lation. Next the gusA expression from the heatshock pro- 
moters was measured in liquid cultures at the two tem- 
peratures, for 5 selected heatshock-regulated promoters, 
see Figure 2. The clones had different activities and 

35 covered a broad range of promoter activities at 30 °C, 
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and 4 out of the 5 clones (all except HP6) gave higher 
promoter activities at 37 °C, which shows that the pro- 
moters were indeed temperature regulated. Interestingly , 
the promoters were regulated by almost the same fold, 
5 i.e. from 1.7 to 2.3 fold by the temperature shift from 
30 to 37 °C, which indicates that the spacer (spacer 1) 
in the inverted repeat is of minor importance with re- 
spect to determining the fold of induction of these arti- 
ficial promoters. 

10 

We have here been looking at the accumulated gusA activ- 
ity, and although the data clearly show that the promot- 
ers are temperature regulated, one skilled in the art of 
analysing gene expression will appreciate that this makes 

15 it more difficult to observe the changes in promoter ac- 
tivities, that are brought about for instance by some 
change in external parameter such as temperature. In ad- 
dition, it has been shown that the activity of heatshock 
regulated promoters is temporarily ten fold higher imme- 

20 diately after the temperature shift, than it appears when 
the steady state levels are compared as they are on Fig- 
ure 2. To observe the change in promoter activity more 
carefully, one should look at the rate of protein or mRNA 
synthesis, before and at various times after the pertur- 

25 bation in temperature. We therefore isolated RNA from the 
5 clones at various times after the change in temperature 
from 30 °C to 37 °C and visualized the gusA mRNA by stan- 
dard Nothern blotting (Maniatis et ai., 1983), to analyze 
how many fold the individual promoter clone was induced 

30 by the temperature change. 

EXAMPLE 3 



The Gram-positive bacterium, Bacillus subtllis is used 
35 extensively as an industrial bioreactor for the produc- 
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tion of a range of heterologous proteins. It was there- 
fore of interest to test whether the random spacers 
method of the invention could also be used to generate 
promoter libraries for this organism. The consensus se- 
5 quences for Bacillus subtllls are very similar to the 
consensus sequences for E. coll and L. lactis, and we 
could therefore test whether the approach was also valid 
for Bacillus subtilis by subcloning a number of CP pro- 
moters into a promoter cloning vector for this Bacillus 

10 subtilis and then ask 1) whether the CP promoters are ac- 
tive in Bacillus subtilis and 2) whether, also in this 
organism, the spacer between the consensus sequences 
plays an important role for the promoter strength. We 
chose to use the promoter cloning vector, pDG268, which 

15 is designed for integration of promoter fusions to lacZ 
into the amy locus on the Bacillus subtilis chromosome. 
The vector confers ampicillin and neomycin resistance, 
and it will replicate in E. coli for the initial cloning 
purposes, but not in Bacillus subtilis. When the line- 

20 arized form of the vector is transformed into Bacillus 
subtilis it will be inserted into the amy locus on the 
Bacillus subtilis chromosome. This example therefore also 
serves as an example of the use of the promoters gener- 
ated through the random spacers methods for modulation of 

25 chromosomally encoded genes. 

The CP promoters were indeed active also in Bacillus sub- 
tilis, and the individual strength of the promoters were 
again very different from each other. The fact that the 
30 promoter library also covered a broad range of promoter 
activities in Bacillus subtilis, shows that the random 
spacers method is also valid for this organism. 
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EXAMPLE 4 

The CP promoters of example 1 were designed for use in L. 
lactis, but the consensus sequences of E. coli promoters 
5 are included in the sequence of the oligonucleotide given 
in example 1, Furthermore, the vector used for generating 
the CP promoter library in example 1 is a shuttle vector 
for L. lactis and E. coli, so this also allowed us to 
analyse the activity of the CP promoters in the Gram- 

10 negative bacterial host, E. coli. Figure 3 shows the ac- 
tivity of 33 of the CP promoters in E. coli. Clearly, the 
activity of the individual CP promoters is also here very 
different, and together the promoters cover a broad range 
of activities. Interestingly, the correlation between the 

15 strength of the individual promoters in E. coli and L. 
lactis was not very strong: some promoters that were 
found to be strong promoters in L. lactis were found to 
be weak in E. coli and vice versa. 

20 The activity of the promoters, in terms of 0- 
galactosidase activity, was generally much lower than the 
activity found in L. lactis. This was probably due to the 
fact that the promoter cloning vector, pAK80 is designed 
and optimised for use in the Gram-positive bacterium, L. 

25 lactis, and the translation efficiency in E. coli could 
therefore be low. We therefore sub-cloned three of the CP 
promoters (CP20, CP22 and CP25) into the promoter cloning 
vector, pCB267, which was designed for cloning E. coli 
promoters in upstream of a promoterless lacZ gene, encod- 

30 ing P-galactosidase . In this promoter system, the CP pro- 
moters turned out to be very efficient promoters, but the 
relative difference in strength between the three promot- 
ers was conserved. Thus, the CP25 promoter gave 2.5 fold 
higher activity than the hybrid promoter, ptac, which is 

35 considered to be among the strongest promoters known for 



WO 98/07846 



PCT/DK97/00342 



30 

use in E. coli. These data therefore further demonstrate 
the strength of our approach: by analysing a relatively 
small number of promoter clones obtained through the ran- 
dom spacers method of the invent ion , we have managed to 
5 arrive at some of the strongest promoters in both E. coli 
and L. lactis. 

EXAMPLE 5 

10 Bacteria belonging to the Gram-negative species, Pseudo- 
monas, are becoming increasingly important due to their 
application in e.g. biodegradation of chemical waste 
products in polluted soil. But also here the genetic en- 
gineering is hampered by the lack of suitable promoters 

15 and expression systems. The literature on Pseudomonas 
promoters revealed that the consensus sequences for Pseu- 
domonas are somewhat less well defined than those of E. 
coli, L. lactis and Bacillus subtilis. Thus, in the -35 
region one often finds TTGR conserved (R=A or G) whereas 

20 the rest of the -35 consensus sequence is varying between 
different promoters. The -10 consensus sequence is proba- 
bly TATRAT. The spacing between the TTGR motive and the 
-10 sequence is 18-19 bp, which is equivalent of the 16- 
17 bp spacer often found in E. coli. It follows that the 

25 consensus sequences for vegetative promoters in this or- 
ganism are quite close to the consensus sequences for E. 
coli and L. lactis. 

We therefore tested a range of the CP promoters from ex- 
30 ample 1 in Pseudomonas putida by cloning the promoters 
into a cloning vector that contains a promoterless P- 
galactosidase gene. Again the activity of the CP promot- 
ers differed in strength over a broad range of promoter 
activities. These results show that also in this species, 
35 the random spacers method of the invention could be used 
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to generate both relatively strong promoters and also a 
broad range of promoter activities covered in small steps 
of activity change 

5 EXAMPLE 6 

We also designed an oligonucleotide based on the consen- 
sus sequences stated in example 5 and incorporated multi- 
ple cloning sites as described in example 1. The follow- 
10 ing oligonucleotide was synthesised, 

MCS-(N)g-TTGR-Ni9-TATRAT-(N)8-MCS 

The oligonucleotide was converted to double stranded DNA, 
15 using a primer homologous to the 3' end of the oligonu- 
cleotide and cloned upstream of a promoterless 0- 
galactosidase gene on a promoter probe vector. The liga- 
tion mixture was transformed directly into a Pseudomonas 
putida strain with selection for the antibiotics resis- 
20 tance carried by the plasmid and on plates containing X- 
gal . This resulted in approximately 100 clones of various 
blue color intensities. Subsequently, 30 clones were ana- 
lysed for P-galactosidase activity as described above. 
These results showed that also in this species, the ran- 
25 dom spacers method of the invention could be used to gen- 
erate both relatively strong promoters and also a broad 
range of promoter activities covered in small steps of 
activity change. 

30 EXAMPLE 7 

Design of a degenerated oligonucleotide for a Saccharomy- 
ces cerevisiae promoter library. 

35 A 199 bp oligonucleotide was designed, which includes, 
starting from the 5' end: an EcoRI restriction site (for 
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use in the subsequent cloning strategy, see below) , the 
consensus UASQCN4pf a 59 bp spacer (spacer 1), the con- 
sensus TATAAA sequence (TATA box), a 38 bp spacer (spacer 
2 ) , the 23 bp repressor binding sequence (which is also 
5 the region in which transcription is initiated), the 
spacer sequence between the TI box and the first codon of 
the ARG8 gene (this spacer is here the native sequence 
from the ARG8 gene including also the first codon of the 
ARG8 gene, ATG) , and, finally, a BamHI site was included, 
10 which is designed to give an in frame fusion with the p- 
galactosidase reporter gene, see below. 

The sequences of spacer 1 and spacer 2 were allowed to 
vary randomly, and the actual oligonucleotide mixture be- 
15 ing synthesized had the following degenerated sequence 
reported in SEQ ID No. 3: 

Plel 
EcoRI Hinfl 
20 Apol Maelll 

1 . ... 

5 » CAGAATTCGTGACTCANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
UAS fl 



25 61 

NNNNNNNNNNNNNNNNTATAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
TATA-box 

Msel Tfil 
30 Aflll Maelll Hinfl 

121 

N CTCTTAAGTQCAAQTQAC TGCGAA CATTTTTTTCGT T T GTTAGAATAAT T C AAGAATC G 
TI box 

35 Mbol 
Dpnl 
Alwl 
NlalV 
BstYI 

40 BamHI 
Alwl 
Nlalll 
181 . 199 

CTACCAATCATGGATCCCG 3* 
45 ARG8 ct «xt 



This oligonucleotide mixture was converted into double 
stranded DNA fragments (DSDF), using an oligonucleotide 
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complementary to the last 23 bp of the 3' end of the 199 
bp degenerated oligonucleotide as described in example 1. 
Subsequently, it was cloned into either of the two pro- 
moter cloning vectors, pYLZ-2 and pYLZ-6 (Hermann et ai., 
5 1992), as follows: the DSDF mixture and the vector were 
both double-digested with EcoRI and BamHI f and the DSDF 
were ligated to the vector DNA. The ligation mixture was 
transformed into E. coli as described in example 1, with 
selection for ampicillin resistance, Plasraid DNA was iso- 
10 lated from 500 individual clones and screened for the 
presence of putative promoter fragments by digestion with 
the restriction enzymes, EcoRI and BamHI. 

17 clones were found to have an EcoRI - BamHI insert of 
15 approximately 200 bp. Plasmid DNA from these 17 clones 
were transformed into 5. cerevisiae, with selection for 
the URA3 marker carried by the plasmids and assayed for 
production of P-galactosidase. Figure 4 shows the result- 
ing activities of 0-galactosidase for the 17 clones. All 
20 the promoters have higher activities than the cloning 
vector itself without promoter fragment inserted (pYLZ- 
2). More important however, also in this case the clones 
cover a range of promoter activities in small steps of 
activity change. 

25 

Sequence analysis revealed that the 17 clones discussed 
above had a perfect TATA box (TATAAA) between the 
spacer 1 and spacer 2, while two of the 17 clones, YP24 
and YP435 each had a defect in the UASGCN4p- However, the 
30 activity of YP435 was 39 units which is close to the 
value obtained with YP212. These data then suggest that 
the impact of the random sequence of the spacers on the 
promoter strength, is stronger than the impact of the 
state of the UASgcN4p binding site. 
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As discussed above, the artificial yeast promoters had 
built in two different regulatory features. One is that 
the promoters should be regulated by the presence of ar- 
ginine in the growth medium. To test whether the art if i- 
5 cial yeast promoters were also regulated by arginine, we 
grew a number of clones in minimal medium, with and with- 
out arginine (SD + 2% glucose; SD + 2% glucose + 1 g/1 
arginine). Figure 5 shows the result of these experi- 
ments. The clones, YP18, YP212, YP435 were indeed regu- 
10 lated 5, 8, 15 fold, respectively, by the presence of ar- 
ginine. YP183 was not regulated by arginine, and sequence 
analysis confirmed that this promoter clone had a 13 bp 
deletion in the arginine repressor binding site. 

15 We also tested whether the promoters were regulated by 
external amino acids in the growth medium, by analyzing 
the promoter activity of some of the yeast promoter 
clones in minimal medium (SD + 2% glucose) and rich me- 
dium (SC + 2% glucose-URA) . Figure 6 shows the result of 

20 these experiments. Indeed, the promoters present in 
clones YP18,YP212, YP435, and YP183 were regulated from 2 
to 10 fold, by the presence of amino acids. YP24 was not 
regulated, in accordance with the error that had occured 
in the UASQCN4p site on this clone, see above. 

25 

The results on amino acid and specific arginine regula- 
tion demonstrate that the random spacers method of the 
invention can be used for generating promoters which 
cover a broad range of promoter activities and which can 
30 be regulated by external signals. The regulatory aspect 
of the invention is here examplified by the amino acid 
and arginine regulation, but is not limited to these 
cases . 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Peter Ruhdal Jensen 

(B) STREET: Soegaardsvej 19 

(C) CITY: Gentofte 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : DK-2820 

(ii) TITLE OF INVENTION: Artificial promoter libraries for selected 
organisms and promoters derived from such libraries 

(iii) NUMBER OF SEQUENCES: 58 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DK 8 86/96 

(B) FILING DATE: 23-AUG-1996 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION: 26. .82 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION :/evidence= EXPERIMENTAL 

/standard_name= "Artificial promoter library" 
/note= "A degenerated sequence specifying a mixture of 
artificial promoters covering a wide range of expression 
in small steps in L. lactis" 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION: 31. .45 

(D) OTHER INFORMATION : /standard__name~ "Consensus 
sequence" 

(ix) FEATURE: 

(A) NAME/ KEY : misc feature 
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(B) LOCATION: 60. .69 

(D) OTHER INFORMATION : / standard_name= "Consensus 
sequence" 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 74. .82 

(D) OTHER INFORMATION: / standard_name- "Consensus 
sequence" 

(ix) FEATURE: 

(A) NAME/ KEY: -35_signal 

(B) LOCATION: 40. .45 

(D) OTHER INFORMATION : /standard__name= "-35 box" 

(ix) FEATURE: 

(A) NAME/ KEY: -10_signal 

(B) LOCATION: 63. .68 

(D) OTHER INFORMATION :/standard_name= "Pribnow box" 

(ix) FEATURE: 

(A) NAME /KEY: misc_recomb 

(B) LOCATION: 3. .25 

(C) IDENTIFICATION METHOD: experimental 

(D> OTHER INFORMATION: /evidence= EXPERIMENTAL 
/standard_name= "Multiple cloning site" 
/label= MCS 

/note= "A sequence specifying recognition sites for the 
restriction endonucleases : NlalV, BstYI, BamHI, Alwl, 
Mbol, Dpnl, Aflll, Msel, Sspl, Nsil." 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 74 . . 98 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence^ EXPERIMENTAL 

/standard_name= "Multiple cloning site" 
/label= MCS 

/note= "A sequence specifying recognition sites for the 
restriction endonucleases: Scal r Rsal, Hpal, Hindi, Msel, 
Sfcl, PstI, Fnu4HI, Bbvl , PvuII, NspBII, Alul, EcoRI , 11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CGGGATCCTT AAGAATATTA TGCATNNNNN AGTTTATTCT TGACANNNNN NNNNNNNNNT 60 

GGTATAATAN NANAGTACTG TTAACTGCAG CTGAATTCGG 100 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 
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(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION: 23. .95 

(D) OTHER INFORMATION: /standard_name= "Artificial 
promoter library" 

/note= "A degenerated sequence specifying a mixture of 
artificial temperature regulated promoters covering a wide 
range of expression in small steps in L. lactis" 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION : 2 3 . .49 

(D) OTHER INFORMATION: /standard_name= "Sequence providing 
temperature regulation to promoters" 

/note= "This sequence comprising two inverted repeats 
separated by a short spacer provides temperature (heat 
shock) regulation to promoters in Gram-positive bacteria" 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION: 50. .60 

(D) OTHER INFORMATION : /standard_name= "Consensus 
sequence" 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 75. .84 

(D) OTHER INFORMATION :/standard_name= "Consensus 
sequence" 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 89. .95 

(D) OTHER INFORMATION : / standard_name= "Consensus * 
sequence" 

(ix) FEATURE: 

(A) NAME/KEY: -35_signal 

(B) LOCATION: 55. .60 

(D) OTHER INFORMATION :/standard_name= "-35 box" 

(ix) FEATURE: 

(A) NAME/KEY: -10_signal 

(B) LOCATION: 78. .83 

(D) OTHER INFORMATION:/ st andard_name= "Pribnow box" 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATIONS. .22 

(D) OTHER INFORMATION: /standard_name= "Multiple cloning 
site" 

/label= MCS 

/note= "A sequence specifying recognition sites for the 
restriction endonucleases : NlalV, BstYI , BamHI, Alwl, 
Mbol, Dpnl, HindlH, Alul, Msel (2 sites), Sspl, Asel . " 

(ix) FEATURE: 

(A) NAME / KEY : misc_recomb 

(B) LOCATION: 89. .111 

(D) OTHER INFORMATION : / s tandard_riame= "Multiple cloning 
site" 

/labels MCS 

/note= "A sequence specifying recognition sites for the 
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restriction endonucleases : Seal, Rsal, Sfcl, PstI, Fnu4HI, 
Bbvl, PvuII, NspBII, Alul, Xbal, Mael, EcoRI, Apol . " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CGGGATCCAA GCTTAATATT AATTAGCACT CNNNNNNNNN GAGTGCTAAT TTTTTTGACA 60 

NNNNNNNNNN NNNNTGGTAT AATANNANAG TACTGCAGCT GTCTAGAATT CGG 113 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

{A) LENGTH: 199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME/ KEY: protein_bind 

(B) LOCATION: 10. .16 

(D) OTHER INFORMATION :/function= "Activating promoters in 
S. cerevisiae" 

/bound_moiety= "GCN4 protein" 

/standard name= "Upstream activating sequence" 
/label= UAS_GCN4p 

/note= "A DNA sequence that specifies a binding site for 
the GCN4 protein, which activates the transcription of 
genes involved in aminoacid synthesis in S. cerevisiae." 

(ix) FEATURE: 

(A) NAME/KEY: TATA_signal 

(B) LOCATION: 67. .72 

(D) OTHER INFORMATION: /s tandard_name= "TATA box" 

(ix) FEATURE: 

(A) NAME/ KEY: misc_signal 

(B) LOCATION: 122. .144 

(D) OTHER INFORMATION :/function= "Transcription 
initiation" 

/standard_name= "Tl box" 

(ix) FEATURE: 

(A) NAME/KEY: protein_bind 

(B) LOCATION: 122. .144 

(D) OTHER INFORMATION : /bound_moiety= "Arginine repressor" 
/standard_name= "arginine repressor binding site" 
/label= argR 

(ix) FEATURE: 

(A) NAME/ KEY: misc_RNA 

(B) LOCATION: 145. . 192 

(D) OTHER INFORMATION :/function= "Spacer" 

/standard_name= "Part of native sequence for ARG8 
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gene incl . first codon" 

(ix) FEATURE: 

(A) NAME/KEY: misc__recomb 

(B) LOCATION: 3 . . 8 

(D) OTHER INFORMATION : /standard_name= "Recognition site 
for restriction endonuclease EcoRI" 
/label= EcoRI_site 

(ix) FEATURE: 

(A) NAME/ KEY: misc_recomb 

(B) LOCATION: 192. .197 

(D) OTHER INFORMATION :/standard_name= "Recognition site 
for restriction endonuclease BamHI" 
/label= BamHI_site 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION: 10. .192 

(D) OTHER INFORMATION : / s tanda rd_name= "Artificial 
promoter library" 

/note= "A degenerated sequence specifying a mixture of 
artificial promoters covering a wide range of expression 
in small steps in S. cerevisiae" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CAGAATTCGT GACTCANNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 60 

NNNNNNNNNN NNNNNNTATA AANNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 120 

NCTCTTAAGT GCAAGTGACT GCGAACATTT TTTTCGTTTG TTAGAATAAT TCAAGAATCG 180 

CTACCAATCA TGGATCCCG 199 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas putida 

(ix) FEATURE: 

(A) NAME /KEY: promoter 

(B) LOCATION : 1 . .45 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION :/evidence= EXPERIMENTAL 

/standard_name= "Artificial promoter library" 
/note- "A degenerated sequence specifying a mixture of 
artificial promoters covering a wide range of expression 
in small steps in Pseudomonas putida" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
NNNNNNNNTT GRNNNNNNNN NNNNNNNNNN NTATRATNNN NNNNN 4 5 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATIONS. .60 

(D) OTHER INFORMATION: /standard_name= "Constitutional 
promoter" 
/labels Cpl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CATACCGGAG TTTATTCTTG ACAGTTCCAC CTCGGGTTGA TATAATATCT CAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION : 4 . .60 

(D) OTHER INFORMATION : /standard_name= "Constitutional 
promoter" 
/label= CplO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



CATGGCTTAG TTTATTCTTG ACAGGGTAGT ATCACTGTGA TATAATAGGA CAGTACTGTT 



60 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 59 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATIONS. .59 

(D) OTHER INFORMATION: /s tandard_name= "Constitutional 
promoter" 
/label= Cpll 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CATAAGTGAG TTTATTCTTG ACCCGGACGC CCCCCTTTGA TATAATAAGT AGTACTGTT 59 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO - 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY : promoter 

(B) LOCATIONS. .60 

(D) OTHER INFORMATION: /standard_name= "Constitutional 
promoter" 
/label= Cpl2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CATATACAAG TTTATTCTTG ACACTAGTCG GCCAAAATGA TATAATACCT GAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION : 4 . . 60 

(D) OTHER INFORMATION : /s tandard_name= "Constitutional 
promoter" 
/label= Cpl3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CATGCTTTAC TTTATTCTTG ACAAAACCAC CAGCTTTTGG TATAATACGT GAGAACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION : 4 . .60 

(D) OTHER INFORMATION: /standard_name= "Constitutional 
promoter" 
/label= Cpl4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CATGACGGAG TTTATTCTTG ACACAGGTAT GGACTTATGA TATAATAAAA CAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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{iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY : promoter 

( B) LOCATION : 4 . . 60 

(D) OTHER INFORMATION: /s tandard_name= "Constitutional 
promoter" 
/label= Cpl5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CATTACNTAG TTNATTCTTG ACAGAATTAC GATTCGCTGG TATAATATAT CAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL; YES 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATION : 4 . . 58 

(D) OTHER INFORMATION: /s tandard_name= "Constitutional 
promoter" 
/label= Cpl6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CATTGTGTAG TTTATTCTTG ACAGCTATGA GTCAATTTGG TATAATAACA GTACTCAG 58 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION : 4 . . 59 

(D) OTHER INFORMATION: /standard_name= "Constitutional 
promoter" 
/label= Cpl7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CATTCTGGAG TTTATTCTTG ACCGCTCAGT ATGCAGTGGT ATAATAGTAC AGTACTGTT 59 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY : promoter 

(B) LOCATIONS. .58 

(D) OTHER INFORMATION : /standard_name= "Constitutional 
promoter" 
/label= Cpl8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CATTTTGCAG TTTATTCTTG ACATTGTGTG CTTCGGGTGT ATAATACTAA GTACTGTT 58 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATIONS. .58 
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(D) OTHER INFORMATION : /standard_name= "Constitutional 
promoter" 
/label= Cpl9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CATCGCTTAG TTTTTCTTGA CAGGAGGGAT CCGGGTTGAT ATAATAGTTA GTACTGTT 58 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATIONS. .60 

(D) OTHER INFORMATION: /standard_name= "Constitutional 
promoter" 
/label- Cp2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CATTTGCTAG TTTATTCTTG ACATGAAGCG TGCCTAATGG TAT ATT AC TT GAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATIONS. .60 

(D) OTHER INFORMATION : /standard_name= "Constitutional 
promoter" 
/label= Cp20 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CATGGGTGAG TTTATTCTTG ACAGTGCGGC CNGGGGCTGA TATCATAGCA GAGTACTATT 60 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION : 4 . .59 

(D) OTHER INFORMATION : /s tandard_name= "Constitutional 
promoter" 
/label= Cp21 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CATTACCGAG TTTATTCTTG ACACCGTTTA TCGGGGTTGT ATAATACTAT AGTACTGTT 59 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION : 4 . . 60 

(D) OTHER INFORMATION :/standard_name= "Constitutional 
promoter" 
/label= Cp23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CATGTAGGAG TTTATTCTTG ACAGATTAGT TAGGGGGTGG TATAATATCT CAGTACTGTT 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY : promoter 

(B) LOCATIONS. .60 

(D) OTHER INFORMATION :/standard_name= "Constitutional 
promoter" 
/label= Cp24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CATGGGTAAG TTTATTCTTC ACACTATCTG GGCCCGATGG TATAATAAGT GACTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATION : 3 . .59 

(D) OTHER INFORMATION: / standard _name= "Constitutional 
promoter" 
/label= Cp25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CTTTGGCAGT TTATTCTTGA CATGTAGTGA GGGGGCTGGT ATAATCACAT AGTACTGTT 59 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY : promoter 

(B) LOCATIONS. .60 

(D) OTHER INFORMATION: /s tandard_name= "Constitutional 
promoter" 
/label= Cp26 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CATTCTACAG TTTATTCTTG ACATTGCACT GTCCCCCTGG TATAATAACT ATACATGCAT 60 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid j 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

iii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATIONS. .60 

(D) OTHER INFORMATION: /standard_name= "Constitutional 
promoter" 
/label= Cp28 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CATGGGGCCG TTTATTCTTG ACAACGGCGA GCAGACCTGG TATAATAATA TAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION : 4 . .59 

(D) OTHER INFORMATION : /s tandardjiame- "Constitutional 
promoter" 
/label* Cp29 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CATCGGTAAG TTATTCTTGA CATCTCAGGG GGGACGTGGT ATAATAACTG AGTACTGTT 59 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME /KEY: promoter 

(B) LOCATIONS. .60 

(D) OTHER INFORMATION : / standard_name= "Constitutional 
promoter" 
/label= Cp3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CATCCTGTAG TTTATTCTTG ACACACGTNN TTAGCTGTGG TATAATAGGA GAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATION: 4. .60 

(D) OTHER INFORMATION: /s tandard_name= "Constitutional 
promoter" 
/label= Cp30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

CATGACAGAG TTTATTCTTG ACAGTATTGG GTTACTTTGG TATAATAGTT GAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

Uv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATIONS. .60 

(D) OTHER INFORMATION : / s tandard_name= "Constitutional 

promoter" 
/label= Cp32 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CATACGGGAG TTTATTCTTG ACATATTGCC GGTGTGTTGG TATAATAACT TAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION : 4 . .60 
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(D) OTHER INFORMATION: /s tandard_name= "Constitutional 
promoter" ) 
/labels Cp33 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

CATGTTGGAG TTTATTCTTG ACATACAATT ACTGCAGTGA TATAATAGGT GAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA ( genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY : promoter 
<B) LOCATIONS. .60 

(D) OTHER INFORMATION : /standard_name= "Constitutional 
promoter" 
/label= Cp34 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CATCGCGAAG TTTATTCTTC ACACACCGCA GAACTTGTGG TATAATACAA CAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY : promoter 

(B) LOCATION : 4 . .59 

(D) OTHER INFORMATION: /standard_name= "Constitutional 

promoter" 
/label= Cp37 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CATCATTAAG TTTATTCTTC ACATTGGCCG GAATTGTTGT ATAATACCTT AGTACTGTT 59 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY : promoter 

(B) LOCATIONS. .60 

(D) OTHER INFORMATION: / standard jname= "Constitutional 

promoter" 
/label= Cp38 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CATAGAGAAG TTTATTCTTG ACAGCTAACT TGGCCTTTGA TATAATACAT GAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY : promoter 

(B) LOCATION : 4 . .60 

(D) OTHER INFORMATION : /standard_name= "Constitutional 
promoter" 
/label= Cp39 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

CATTGCGAAG TTTATTCTTG ACAGTACGTT TTTACCATGA TATAATAGTA TAGTACTGTT 60 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME /KEY: promoter 

(B) LOCATION: 4. .60 

(D) OTHER INFORMATION: /s tandard_name= "Constitutional 
promoter" 
/label= Cp4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GATGTTTTAG TTTATTCTTG ACACCGTATC GTGCGCGTGA TATAATCGGG ATCCTTAAGA 60 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 
<B) LOCATIONS 

(D) OTHER INFORMATION : / standard_name= "Constitutional 
promoter" 
/label= Cp40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

CATAGAACAG TTTATTCTTG ACATTGAATA AGAAGGCTGA TATAATAGCC AGTACTGTT 59 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATIONS. .60 

(D) OTHER INFORMATION: /s tandard_name= "Constitutional 
promoter" 
/labels Cp41 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CATCCGCAAG TTTATTCTTG ACAGCTGAAT GTAGACGTGG TATAATAGTT AAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATIONS. .60 

(D) OTHER INFORMATION : / s tanda r d_name= "Constitutional 
promoter" 
/label= Cp42 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

CATTCGTAAG TTTATTCTTG ACACCTGAGA TGAGGCGTGA TATAATAAAT AAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION : 4 . .59 

(D) OTHER INFORMATION :/standard_name= "Constitutional 
promoter' 1 
/label= Cp4 4 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CATCGGGTAG TTTATTCTTG ACAATTAAGT AGAGCCTGAT ATAATAGTTC AGTACTGTT 59 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE; 

(A) NAME/ KEY: promoter 

(B) LOCATION: 4. .59 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION :/evidence= EXPERIMENTAL 

/standard_name= "Constitutional promoter" 
/label= Cp5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

CATGGGGGAG TTTATTCTTG ACATCATCTT CGTAGCCTGG TAT ACT AC AT GAGTATGTT 59 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 



(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE : 

(A) NAME/ KEY : promoter 

(B) LOCATIONS. .60 

(D) OTHER INFORMATION : /standard_name= "Constitutional 
promoter" 
/label= Cp6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CATGTGGGAG TTTATTCTTG ACACAGATAT TTCCGGATGA TATAATAACT GAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATIONS. .60 

<D) OTHER INFORMATION : /standard_name= "Constitutional 
promoter" 
/label= Cp7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

TATGCGGTAG TTTATTCTTG ACATGACGAG ACAGGTGTGG TATAATGGGT CTAGATTAGG 60 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/KEY: promoter 
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(B) LOCATION : 4 . . 60 

(D) OTHER INFORMATION : / s tandard_name= "Constitutional 
promoter" 
/label= Cp8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CATTCTTTAG TTTATTCTTG ACAAACGTAT TGAGGACTGA TATAATAGGT GAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 60 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Lactococcus lactis 

(ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATION: 4. .60 

(D) OTHER INFORMATION : /standard_name= "Constitutional 
promoter" 
/label= Cp9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

CATAGTCTAG TTTATTCTTG ACACGCGGTC CATTGGCTGG TATAATAATT TAGTACTGTT 60 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME /KEY: promoter 

(B) LOCATION: 8. .177 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION : / evidence= EXPERIMENTAL 

/standard_name= "Yeast promoter" 
/label= Ypl02 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

GAATTCGTGA CTCAAACGGG TGGTCGACGG GTGGTTCCAA TTAATTGGCG TCCCTCTTAT 60 

AAAGGCGAGG GTACGTGCGA CAATTGGTAG AGCGAGCGGG GCTCTTAAGT GCAAGTGACT 120 

GCGAACATTT TTTTCGTTTG TTAGAATAAT TCAAGAATCG CTACCAATCA TGGATCC 177 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATIONS. .181 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION : / evidence= EXPERIMENTAL 

/standard_name= "Yeast promoter" 
/label= Ypll2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GAATTCGTGA CTCACGGCAT CTGATGGTTG ACCATAGTCA GGAACATTGT GCTGGAGTTC 60 

CTTGAGGAAT GAGTTATAAA ATGGGAGGTT GCGGCTAATG CCAGGCAGGA GAGGAACCCT 120 

CTTAAGTGCA AGTGACTGCA AACATTTTTT TCGTTTGTTG AATCGCTACC AATCATGGAT 18 0 

CC , 182 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 
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(A) NAME/ KEY: promoter 

( B) LOCATION: 8. .181 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION : / evidence= EXPERIMENTAL 

/standard_name= "Yeast promoter" 
/label= Ypl3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

GAATTCGTGA CTCACTAGGC AGGTCACGTT GGCTCTTCGC GGCGCAGGTT CGTATGCCGC 60 

GCCGCCAGGG GCTTTATAAA GGTCGTCCTG GGTACAGTTG GGATGGCTCC ACGTTTCGGC 120 

TCTTAAGTGC AAGTGACTGC GAACATTTCG TTTGTTAGAA TAATTCAAGA ATCGCTACCA 180 

ATCATGGATC C 191 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATIONS. .167 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION :/evidence= EXPERIMENTAL 

/standard_name= "Yeast promoter" 
/label= Ypl5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GAATTCGTGA CTCAGGGCCG TACTAAGTAG CTTTCGTATG CTATGCGGGG TTTTATAAAT 60 

CTTTGGGCCA TGGTCTTGCT GGAAAACACC TCTCTTAAGT GCAAGTGACT GCGAACATTT 120 

TTTTCGTTTG TTAGAATAAT TCAAGAATCG CTACCAATCA TGGATCC 167 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME/ KEY : promoter 

(B) LOCATION : 8 . .191 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 

/standard_name= "Yeast promoter" 
/label= Ypl54 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

GAATTCGTGA CTCACCGCTC GGGTGCAGGG CCAAGGCGGC GGAATGTGCG GGGCGTTCTA 60 

GCGCAATCGG GGTATAAATT TATAAGGAGG CTGCGGGTGC TAGTTTGTCT AGTTTGACTC 120 

TTAAGTGCAA GTGACTGCGA ACATTTTTCG TTTGTTAGAA TAATTCAAGA ATCGCTACCA 18 0 

ATCATGGATC C 191 

(2) INFORMATION FOR SEQ ID NO: 4 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae • 

(ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATION: 8. .190 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence= EXPERIMENTAL 

/standard_name= "Yeast promoter" 
/label= Ypl8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

GAATTCGTGA CTCAGGATTA GCTATGCCGG TTGGGATAAG CGAACAACTG GAGGTGAGAA 60 

GCTTTTTGTC AGAATATAAA CCCGTTAGTC AGGGTTTGGT GGGATAGGGG GTACTGTACC 120 

TCTTAAGTGC AAGTGACTGC GAACATTTTT TTCGTTTGTT AGAATAATTC AAGAAT CGCT 180 

ACCAATCATG GATCC 195 

(2) INFORMATION FOR SEQ ID NO: 49: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 
{ B) LOCATIONS. .17 9 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION : / evidence= EXPERIMENTAL 

/standard_name= "Yeast promoter" 
/label= Ypl63 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: AS: 

GAATTCGTGA CTCACTAAGG GTTCGCCATT AACAGAATCG CTGGTAGAAC ATCGGTAGTT 60 

AGGCACCCGA GTATAAACAG GCGGACCCCT CACGGATATC AGCTGATAGT GCGAGCCTCA 120 

ATGCGAACAT TTTTTTCGTT TGTTAGAATA ATTCAAGAAT CGCTACCAAT CATGGATCC 17 9 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME /KEY: promoter 

(B) LOCATIONS. .190 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence^ EXPERIMENTAL 

/standard name= "Yest promoter" 
/labels Ypl90 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

GAATTCGTGA CTCAGTATCC ACGGGTGTTT GAGGGCTGGT CGCAGGTTAG CAGGCGAGGG 60 

CGGGTGGTTA CGGCTATAAA TGAGTGTTTG CAGCCGGGTA CGGGCGTACG AGTAGTGATC 120 
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TCTTAAATGC AAGTGACTGC GAACATTTTT TTCGTTTGTT AGAATAATTC AAGAATCGCT 18 0 

ACCAATCATG GATCC 195 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATIONS. .189 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION :/evidence= EXPERIMENTAL 

/standard_name~ "Yest promoter" 
/label= Ypl91 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GAATTCGTGA CTCAATGCTG CGGGCGGCAG GAGTCTGGTG TAACTTCCCA TTTTGAGTGA 60 

AAGACAGACC ATCTATAAAC ATTTGGTGGG CAAAGTGGCC TGGCGGATTT GTTTGGACTC 120 

TTAAGTGAAA GTGACTGCGA ACATTTTTTT CGTTTGTTAG AATAATTCAA GAATCGCTAC 180 

CAATCATGGA TCC 193 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATION: 8 . . 166 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION :/evidence= EXPERIMENTAL 
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/standard_name= "Yeast promoter" 
/label= Ypl92 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

GAATTCGTGA CTCACTTAAG GCTACTGCGG AAGTTTAGAT CTAAGGTCGG AAATAATTTA 

GAAAATTACG AC AT TAT AAA TAGCGGAGAG GCCAGGTGAT GGGCACCATT GTGGGGGGGC 

TCTTAATTGT TAGAATAATT CAAGAATCGC TACCAATCAT GGATCC 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

<ix) FEATURE: 

(A) NAME/ KEY : promoter 

(B) LOCATION: 8. .190 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION :/evidence== EXPERIMENTAL 

/standard_name= "Yeast promoter" 
/label= Yp212 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

GAATTCGTGA CTCAGTCGCC CGCAAGATGG GATGGTGCAT TTTAAACACC CGAATTATAC 60 

TCGTCAACTT ATAGTATAAA CGGAACGCGA CGATACGTTC TAGTTTTCGG CGAAGTCGAC 12 0 

TCTTAAGTGC AAGTGACTGC GAACATTTTT TTCGTTTGTT AGAATAATTC AAGAATCGCT 180 

ACCAATCATG GATCC 195 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
166 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 



WO 98/07846 



PCT/DK97/00342 



67 



(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME/ KEY : promoter 

(B) LOCATION: 8. .183 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION :/evidence= EXPERIMENTAL 

/standard_name= "Yeast promoter" 
/label= Yp24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

GAATTCGTAC TCACGACAGC GTTATGACTT CGAGGACCAG CTACTTCCGG TCGCGTACTA 60 

GTTTTTACCT GTATAAACTT TGCTACCGCT GGGCCTTGGT GGTGCTGTCC CGCTCTTAAG 12 0 

TGCAAGTGAC TGCGAACATT TTTTTCGTTT GTTACAATAA TTCAAGAATC GCTACCAATC 180 

ATGGATCC 188 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES - 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME/ KEY : promoter 

(B) LOCATIONS. .190 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION : / evidence= EXPERIMENTAL 

/standard_name= "Yeast promoter" 
/label= Yp435 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

GAATTCGTGA CTAAATGGAT AAGGTTATCG CCATCACGGA GTCTTCTCTC ACGTCTGGAG 60 

CAGAGGCTAG ACCTTATAAA TTATACATGG TGGGAGAGGC GATAGTCTTT AGAGACGTGC 120 

TCTTAAGTGC AAGTGACTGC GAACATTTTT TTCGTTTGTT AGAATAATTC AAGAATCGCT 180 

ACCAAT CAT G GATCC 195 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 189 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATIONS. .184 . 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence= EXPERIMENTAL 

/standard_name= "Yeast promoter" 
/label= Yp68 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

GAATTCGTGA CTCACAAGAA TGTGGGCGGG TCGTTAAACT GAGCCTGGAC ACCTTGGCTG 60 

CGTCGCTTTC GTATAAAGAT CTTAGAGCTG TGGAGTCTGG GTCGAGTGGC CAGCTCTTAA 120 

ATGCAAGTGA CTGCGAACAT TTTTTTCGTT TGTTAGAATA ATT C AAGAAT CGCTACCAAT 180 

CATGGATCC 18 9 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME/ KEY: promoter 

(B) LOCATIONS. ,190 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence= EXPERIMENTAL 

/standard_name= "Yeast promoter" 
/labels Yp89 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57; 

GAATTCGTGA CTCACTCGGA AGATTGGGTT TACGATTAGG ATGGCGCGGC AGAACCGGGG 60 

GGGATTCCCT TCTATATAAA GGGTTCCGAT ACTACGTGCT GCGGACGGCC GATCGAGTTA 120 



TCTTAAGTGC AAGTGACTGC GAAAATTTTT TTCGTTTGTT AGAATAATTC AAGAATCGCT 180 
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ACCAATCATG GATCC 195 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE: 

(A) NAME/ KEY : promoter 

(B) LOCATION : 8 . . 171 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION :/evidence= EXPERIMENTAL 

/standard_name= "Yeast promoter" 
/label= Yp96 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GAATTCGTGA CTCATCTAGT GAGAGGAGCC GTGGTATCTT GTGTCACCAC CAGGGGAAAA 60 

TAATGGCAGG GGTGTATAAA TGGTCGAGTA GTCGCGACCC ACGCTGCAAG GCAAGGAACT 120 

CTTAAATTTT TTTCGTTTGT TAGAATAATT CAAGAATCGC TACCAATCAT GGATCC 176 
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PATENT CLAIMS 

1. An artificial promoter library for a selected organ- 
ism or group of organisms, comprising a mixture of double 

5 stranded DNA fragments the sense strands of which com- 
prise at least two consensus sequences of efficient pro- 
moters from said organism or group of organisms, or parts 
thereof comprising at least half of each, and surrounding 
or intermediate nucleotide sequences (spacers) of vari- 
10 able length in which at least 7 nucleotides are selected 
randomly among the nucleobases A, T f C and G, with the 
proviso that previously known promoter sequences and pro- 
moter sequences isolated from natural sources are not 
comprised. 

15 

2. An artificial promoter library according to claim 1, 
wherein at least 10, preferably at least 12, and more 
preferably at least 14, nucleotides in the spacer se- 
quence^) are selected randomly among the nucleobases A, 

20 T, C and G. 

3. An artificial promoter library according to claim 1 
or 2, wherein the sense strands of the double stranded 
DNA fragments include a regulatory DNA sequence imparting 

25 a specific regulatory feature to the promoters of the li- 
brary . 

4. An artificial promoter library according to claim 3, 
wherein the specific regulatory feature is activation by 

30 a change in the growth conditions. 

5. An artificial promoter library according to any one 
of claims 1-4, wherein the double stranded DNA fragments 
have a sequence comprising one or more recognition sites 

35 for restriction endonucleases added to one of or both 
their ends . 
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6. An artificial promoter library according to claim 5, 
wherein the double stranded DNA fragments have a sequence 
specifying multiple recognition sites for restriction en- 
donucleases (multiple cloning sites MCS) added to one of 
or both their ends . 

7. An artificial promoter library according to any one 
of claims 1-6 , wherein the selected organism or group of 
organisms is selected from the group consisting of pro- 
karyotes . 

8. An artificial promoter library according to claim 7, 
wherein the prokaryotic microorganism is selected from 
the group consisting of bacteria belonging to the genera 
Lactococcus , Streptococcus , Enterococcus , Lac tobacil 1 us , 
Leuconostoc, Escherichia, Bacillus and Pseudomonas. 

9. An artificial promoter library according to claim 8, 
wherein the prokaryotic microorganism is selected from 
the group consisting of Lactococcus lactis, Streptococcus 
thermophilus, Escherichia coli, Bacillus subtilis and 
Pseudomonas putida. 

10. An artificial promoter library according to any one 
of claims 7-9, wherein said consensus sequences comprise 
the -10 signal (-12 to -7): TATAAT and at least one acti- 
vator protein binding site upstream of the -10 signal or 
parts thereof comprising at least 3 conserved nucleotides 
of each. 

11. An artificial promoter library according to any one 
of claims 7-9, wherein said consensus sequences comprise 
the -35 signal (-35 to -30): TTGACA and the -10 signal 
(-12 to -7): TATAAT or parts of both comprising at least 
3 conserved nucleotides of each. 
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12 • An artificial promoter library according to claim 11, 
wherein said consensus sequences comprise from 4 to 6 
conserved nucleotides of the -35 signal and from 4 to 6 
conserved nucleotides of the -10 signal, preferably 5 or 
5 6, and more preferably all 6 nucleotides of each. 

13. An artificial promoter library according to any one 
of claims 10-12, wherein said consensus sequences further 
comprise intervening conserved motifs. 

10 

14. An artificial promoter library according to claim 
13, wherein said intervening conserved motifs are se- 
lected from the group consisting of the conserved motifs 
-44 to -41: AGTT, -40 to -36: TATTC, -15 to -14: TG, and 

15 +1 to +8: GTACTGTT. 

15. An artificial promoter library according to any one 
of claims 11-14, wherein the length of the spacer between 
the -35 signal and the -10 signal is 14-23 bp, preferably 

20 16-18 bp, and more preferably 17 bp. 

16. An artificial promoter library according to any one 
of claims 11-15, wherein the sense strands of the double 
stranded DNA fragments have the degenerated sequence 

25 shown in SEQ ID No. 1 with minor variations in the con- 
sensus sequences and spacer lengths . 

17. An artificial promoter library according to any one 
of claims 11-15, wherein the sense strands of the double 

30 stranded DNA fragments, including an upstream temperature 
regulatory DNA sequence, have the degenerated sequence 
shown in SEQ ID No. 2 with minor variations in the con- 
sensus sequences and spacer lengths. 

35 18. An artificial promoter library according to any one 
of claims 1-6, wherein the selected organism or group of 
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organisms is selected from the group consisting of eu- 
karyotes . 

19. An artificial promoter library according to claim 18 , 
5 wherein the eukaryotic organism or group of organisms is 

selected from the group consisting of eukaryotic microor- 
ganisms . 

20. An artificial promoter library according to claim 
10 19, wherein the eukaryotic microrganism or group of mi- 
croorganisms is selected from the group consisting of 
yeasts, other fungi and mammalian cell lines. 

21. An artificial promoter library according to claim 
15 20 , wherein the eukaryotic microorganism is a yeast of 

the species Saccharomyces cerevisiae. 

22. An artificial promoter library according to any one 
of claims 18-21 , wherein said consensus sequences com- 

20 prise a TATA box and at least one upstream activation se- 
quence (UAS) . 

23. An artificial promoter library according to claim 

22, wherein said consensus sequences further comprise a 
25 transcription initiation signal (TI box) functioning in 

Saccharomyces cerevisiae. 

24. An artificial promoter library according to claim 

23, wherein said consensus sequences comprise the TATA 
30 box: TATAAA, the UASQCN4p- TGACTCA , and the TI box: 

CTCTTAAGTGCAAGTGACTGCGA, which also functions as the 
binding site for the arginine repressor, argR. 

25. An artificial promoter library according to any one 
35 of claims 22-24, wherein the spacer length between the 

TATA box and the first UAS and between this and further 
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UAS, if any, is up to 1000 bp, preferably from 10 to 500 
bp, and more preferably from 15 to 150 bp, 

26. An artificial promoter library according to any one 
5 of claims 22-25, wherein the sense strands of the double 
stranded DNA fragments have the degenerated sequence 
shown in SEQ ID No. 3 with minor variations in the con- 
sensus sequences and spacer lengths . 

10 27. An artificial promoter selected from an artificial 
promoter library according to any one of claims 1-26. 

28. A promoter according to claim 27, comprising a DNA 
sequence selected from the group consisting of the SEQ 

15 IDs Nos. 5-58. 

29. An artificial promoter derived from an artificial 
promoter according to claim 27 or 28 » 

20 30. A method of constructing an artificial promoter li- 
brary for a selected organism or group of organisms, 
which comprises selecting at least two consensus se- 
quences of efficient promoters from said organism or 
group of organisms; synthesizing a mixture of single 

25 stranded DNA sequences comprising said consensus se- 
quences, or parts thereof comprising at least half of 
each, and surrounding or intermediate nucleotide se- 
quences (spacers) of variable length in which at least 7 
nucleotides are selected randomly among the nucleobases 

30 A, T, C and G; and converting the single stranded DNA se- 
quences into double stranded DNA fragments by second 
strand synthesis. 

31. A method according to claim 30, wherein at least 10, 
35 preferably at least 12, and more preferably at least 14, 
nucleotides in the spacer sequence(s) are selected ran- 
domly among the nucleobases A, T, C and G. 
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32. A method according to claim 30 or 31, wherein the 
single stranded DNA sequences include a regulatory DNA 
sequence imparting a specific regulatory feature to the 
promoters of the library. 

5 

33. A method according to claim 32, wherein the specific 
regulatory feature is activation by a change in the 
growth conditions 

10 34. A method according to any one of claims 30-34, 
wherein a sequence specifying one or more recognition 
sites for restriction endonucleases is added to one of or 
both the ends of the single stranded DNA sequences in the 
synthesis, or a linker comprising such restriction sites 

15 is ligated to one of or both the ends of the double 
stranded DNA fragments. 

35. A method according to claim 34, wherein a sequence 
specifying multiple recognition sites for restriction en- 

20 donucleases is added to one of or both the ends of the 
single stranded DNA sequences in the synthesis, or a 
linker comprising such multiple cloning sites (MCS) is 
ligated to one of or both the ends of the double stranded 
DNA fragments . ' 

25 

36. A method according to any one of claims 30-35, 
wherein the selected organism or group of organisms is 
selected from the group consisting of prokaryotes. 

30 37. A method according to claim 36, wherein the prokary- 
otic microrganism or group of microorganisms is selected 
from the group consisting of bacteria belonging to the 
genera Lactococcus, Escherichia, Bacillus and Pseudo- 
monas. 

35 

38. A method according to claim 37, wherein the prokary- 
otic microorganism is selected from the group consisting 
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of Lactococcus lactis, Escherichia coli, Bacillus sub- 
tilis and Pseudomonas putida. 

39. An artificial promoter library according to any one 
5 of claims 36-38 , wherein said consensus sequences com- 
prise the -10 signal (-12 to -7): TATAAT and at least one 
activator protein binding site upstream of the -10 signal 
or parts thereof comprising at least 3 conserved nucleo- 
tides of each, 

10 

40. A method according to any one of claims 36-38, 
wherein said consensus sequences comprise the -35 signal 
(-35 to -30): TTGACA and the -10 signal (-12 to -7); 
TATAAT or parts of both comprising at least 3 conserved 

15 nucleotides of each. 

41. A method according to claim 40, wherein said consen- 
sus sequences comprise from 4 to 6 conserved nucleotides 
of the -35 signal and from 4 to 6 conserved nucleotides 

20 of the -10 signal, preferably 5 or 6 , and more preferably 
all 6 nucleotides of each. 

42. A method according to any one of claims 39-41, 
wherein said consensus sequences further comprise inter- 

25 vening conserved motifs. 

43. A method according to claim 42, wherein said inter- 
vening conserved motifs are selected from the group con- 
sisting of the conserved motifs -44 to -41: AGTT, -40 to 

30 -36: TATTC, -15 to -14: TG, and +1 to +8: GTACTGTT . 

44. A method according to any one of claims 40-43, 
wherein the length of the spacer between the -35 signal 
and the -10 signal is 14-23 bp, preferably 16-18 bp, and 

35 more preferably 17 bp. 
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45. A method according to any one of claims 40-44, 
wherein the mixture of single stranded DNA sequences has 
the degenerated sequence shown in SEQ ID No. 1 with minor 
variations in the consensus sequences and spacer lengths. 

5 

46. A method according to any one of claims 40-42 and 
44, wherein the mixture of single stranded DNA sequences 
including an upstream temperature regulatory DNA sequence 
has the degenerated sequence shown in SEQ ID No. 2 with 

10 minor variations in the consensus sequences and spacer 
lengths . 

47. A method according to any one of claims 30-35, 
wherein the selected organism or group of organisms is 

15 selected from the group consisting of eukaryotes. 

48. A method according to claim 47, wherein the eukary- 
otic organism or group of organisms is selected from the 
group consisting of eukaryotic microorganisms. 

20 

49. A method according to claim 48, wherein the eukary- 
otic microorganism or group of microorganisms is selected 
from the group consisting of yeasts, other fungi and mam- 
malian cell lines. 

25 

50. A method according to claim 49, wherein the eukary- 
otic microorganism is a yeast of the species Saccharomy- 
ces cerevisiae. 

30 51. A method according to any one of claims 47-50, 
wherein said consensus sequences comprise a TATA box and 
at least one upstream activation sequence (UAS). 

52. A method according to claim 51, wherein said consen- 
35 sus sequences further comprise a transcription initiation 
signal (TI box) functioning in Saccharowyces cerevisiae. 
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53. A method according to claim 52, wherein said consen- 
sus sequences comprise 

the TATA box: TAT AAA, the UAScCN4p* TGACTCA , and the TI 
box: CTCTTAAGTGCAAGTGACTGCGA, which also functions as the 
5 binding site for the arginine repressor , argR. 

54. A method according to any one of claims 51-53, 
wherein the spacer length between the TATA box and the 
first UAS and between this and further UAS, if any, is up 

10 to 1000 bp, preferably from 10 to 500 bp, and more pref- 
erably from 15 to 150 bp. 

55. A method according to any one of claims 51-54, 
wherein the sense strands of the double stranded DNA 

15 fragments have the degenerated sequence shown in SEQ ID 
No. 3 with minor variations in the consensus sequences 
and spacer lengths. 

56. A method of optimizing the expression of a gene in a 
20 microorganism, which comprises 

a) selecting a set of promoters covering a range of pro- 
moter activities in relatively small steps of activity 
change from an artificial promoter library according to 

25 any one of claims 1-26 or constructed by the method ac- 
cording to any one of claims 30-55; 

b) cloning said set of promoters into said organism plac- 
ing in each clone said gene under the control of at least 

30 one promoter from the set; 

c) growing the selected clones and screening them to find 
the one showing optimized flux of product formation. 
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57. A method according to claim 56 , wherein the selected 
organism is selected from the group consisting of pro- 
karyotic and eukaryotic microorganisms. 
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